THE ROCKET PROBLEM IN GENERAL RELATIVITY 



PEDRO G. HENRIQUES AND JOSE NATARIO 

Abstract. We derive the covariant optimality conditions for rocket trajecto- 
ries in general relativity, with and without a bound on the magnitude of the 
proper acceleration. The resulting theory is then applied to solve two specific 
problems: the minimum fuel consumption transfer between two galaxies in 
a FLRW model, and between two stable circular orbits in the Schwarzschild 
spacetime. 



1. Introduction 

The theory of optimal rocket trajectories in the context of Newtonian mechanics 
is beautifully developed in the classical reference P]. In this paper, we generalize 
this theory to the general relativity setting. We find that it is possible to formulate 
the theory in a fully covariant fashion, with all relevant auxiliary variables having 
geometric meaning. 

While it can be argued that this generalization is of little practical interest (we 
are quite far from being able to build a rocket capable of moving at relativistic 
speeds!), it turns out that the geometric perspective of general relativity brings 
fresh insights into this classical application of control theory - for instance, the 
relation of the primer equation with the Jacobi equation (Theorem 14. or the fact 
that ignorable coordinates restrict variations to geodesies with the same Killing 
conserved quantities (Theorem 18. ip . Moreover, this theory can be used to address 
theoretical issues in general relativity [? . 

The structure of the paper is as follows. In Section [2l we derive the well known 
rocket equation in general relativity, relating the initial and final rest masses of a 
rocket following a given spacetime trajectory. This is done mainly for the reader's 
convenience, as well as to fix the notation. In Section |31 we recall the details of 
Mayer problem in optimal control theory, which we use in Section 2] to derive the 
optimality conditions under the assumption that the magnitude of the proper accel- 
eration is bounded. These conditions are essentially a covariant differential equation 
for an auxiliary vector field defined along the trajectory, called the primer, whose 
direction gives the direction of the acceleration, and whose magnitude determines 
whether the rocket is undergoing maximum acceleration, intermediate acceleration, 
or free-falling. In Section [Sj we examine the behavior of the optimal trajectories 
as we let the bound on the acceleration tend to infinity. We find that the maxi- 
mum acceleration arcs collapse to instantaneous accelerations, whose locations and 
directions are again determined by the primer. In Section [6l we study the general 
1-dimensional problem, which we then particularize in Section [7] to the problem of 
finding the minimum fuel consumption transfer between two galaxies in a FLRW 
model. Besides the general theory, we use a Lorentzian version of the Gauss-Bonnet 
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Theorem to find these optimal trajectories, under the assumption that the expan- 
sion of the universe is either accelerating or non-accelerating. In Section[8l we study 
the role of ignorable coordinates, which we use in Section IH] to solve the problem of 
finding the minimum fuel consumption transfer between two stable circular orbits 
in the Schwarzschild spacetime. 

We use the conventions of [3 , including geometrized units (in which the speed 
of light and Newton's gravitational constant are set equal to 1) and the Einstein 
summation convention; when using the latter, Greek letters will always represent 
spacetime indices. 

2. Rocket Equation in General Relativity 

To study a relativistic rocket moving in d space dimensions, we must consider a 
(d+l)-dimensional spacetime (M, g), that is, a smoothQ (d-l-l)-dimensional manifold 
M endowed with a Lorentzian metric g. We assume (M, g) to be time-orientable, 
with a definite choice of time orientation. If the rocket's size is sufficiently small (as 
compared to the local radius of curvature), so that it can be modelled as a point 
particle, then its spacetime trajectory is given by a smooth future-directed timelike 
curve c : [to,ti] C M — > M, which we assume to be parameterized by the proper 
time r (that is, the spacetime velocity c is a unit vector, g{c,c) = — 1). Since the 
rocket must eject particles to accelerate, its rest mass will not be constant, but will 
be given instead by a smooth function m : [ro,Ti] K"*". For the same reason, the 
rocket's energy- momentum vector mc will not be conserved, and so we will have 

(1) Vc(mc)-fJ = 0, 

where V is the Levi-Civita connection of g and J is a vector field along c representing 
the instantaneous rate (with respect to proper time) at which energy-momentum is 
being carried away by the exhaust. The only a priori restriction to J is that it must 
be timelike or null and future-directed, corresponding to the physical requirement 
that the speed of the exhaust particles must not exceed the speed of light. Rewriting 
© as 

(2) J = —rhc — mVcC, 

and noting that the covariant acceleration VcC is orthogonal to the unit tangent 
vector c (hence spacelike), we see that the a priori restriction on J requires that 
TO < (so that J is future-directed), that is, the rocket's rest mass is a nonincreasing 
function. Moreover, we must have J = whenever to = (because J cannot be 
spacelike), and so 

(3) J = -m{c + V) 

for some spacelike vector field V orthogonal to c satisfying g(V, V) < 1 (so that J 
is timelike or null). Note that the norm \V\ := \g{V, V)\^ can be interpreted as the 
instantaneous speed of the exhaust particles as measured by the rocket. Comparing 
^ and (O, we have 

(4) rnVcC = mV ^ m\V\ = -m\Vcc\ 

(the minus sign coming from the fact that m is nonincreasing). 

In this paper, we will be interested in the case where speed of the exhaust 
particles is constant, that is, |y| = t; for some constant v satisfying < v < 1 (the 
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extreme case v — 1 corresponding to the exhaust particles moving at the speed of 
hght, that is, a photon rocket). Integrating Q we then obtain the following result 
(see also H [H 1 [3 1 1 [TO] ) . 

Theorem 2.1. For the relativistic rocket model specified above, the final rest mass 
mi := m(ri) is related to its initial rest mass toq ■= ™('''o) by the formula 

(5) mi — mo exp [ — / iVc c Idr] . 

V vJn, J 

To minimize the fuel consumption, one must then minimize the action deter- 
mined by the second order Lagrangian L := |Vcc|, which is not differentiable 
along its zero level set. For this reason, the standard Euler-Lagrange equations 
cannot be used directly, and one must turn to optimal control theory. 



3. Mayer Problem 

In this section, we briefly review the Mayer problem in optimal control. For 
more details see [I] [TT] and references therein. 

The setup of the Mayer problem is as follows: on a compact interval [to,ti] we 
consider n continuous, piecewise smootlQ functions : [to,ti] — > M, called the 
state variables, and m piecewise smooth functions : [tq,ti] M, called the 
control variables. Each choice of the control variables completely determines the 
state variables via the Cauchy problem 

iHr) = r{x\T), . . . , x"(r), ui(r), . . . , «"(t), r), 

(where the functions /* are smootljl). We require the control variables to satisfy 
the p < m independenlQ constraint equations 

/(xi(r),...,x"(r),ui(r),---,"™W,T)=0 

for each r G [to,ti], and the state variables to satisfy the endpoint conditions 

for i ^ 1, . . . , q < n. The Mayer problem consists in choosing the control variables so 
that J{x'l^^ , . . . , a;") is minimized, where J : R"^^ — > R is a given smooth function 
and x\ := x^{ti) for i = q + 1, . . . ,n. The problem can be slightly generalized by 
letting Ti be variable, in which case J may also depend on ti . 

The solution of the Mayer problem is obtained from the Pontryagin Maximum 
Principle as follows: we introduce n additional variables pi, called the momenta, 
and define the Hamiltonian 

H{x^,...,x",pi,...,pn,T) := max pj/*(x\ . . . , x", u\ . . . , u™, r), 

[u^ ,. . . ) 



We take piecewise smootli functions on [tq , ti ] to be given by restrictions of smooth functions 
defined on M to the subintervals of a partition of [tq, ti]; in particular, piecewise smooth functions 
and all their derivatives have one-sided limits at all points. 

■^It is possible to formulate the Mayer problem with much lower regularity; here, for simplicity, 
we consider only (piecewise) smooth functions, as is usual in differential geometry. 

^That is, the Jacobian matrix has rank p. 
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where the maximum is taken over all satisfying the constraint equations at time 
T. Then J is minimized for state variables solving Hamilton's equations 

I V - 

L^' ~ dx' ' 

with the initial conditions Piiro) chosen such that a;'(Ti) = x\ for i — 1, . . . ^ q and 

dj 

for i = q + 1, . . . ,n. If ti is allowed to vary, then its value can be determined from 
the additional endpoint condition 

Computing the Hamiltonian is a constrained maximization problem, and leads 
to the conditions 

where F is the Lagrange expression 

. . . , a;",pi, . . . ,p„,m\ . . . , u", /xi, . . . ,^p,t) 

P^f{x\ ...,x",u\...,u"',t)- fik9Hx\ . . . , X", w\ . . . , U™, t). 

These conditions, together with the constraint equations, yield and fik as func- 
tions of the variables {x^, . . . , a;",pi, . . . ,p„, r), thereby introducing p additional 
piecewise smooth functions /i^ : [to,ti] M, which are called the Lagrange mul- 
tipliersQ 

It is possible to use the Lagrange expression to obtain a shortcut for writing 
Hamilton's equations. Indeed, if the constraints are satisfied, then 

dg'' dg^ du^ _ ^ 
dx^ du^ dx'- ' 



and hence 



dH _ dp dp dui _ dp dg'' du^ 
dx'- dx' dui dx' dx' dui dx' 
dp dg'' _ dF 
^' dx' dx' dx' 
Therefore, Hamilton's equations can be written as 

\Pi = --g^i 

without the need to explicitly compute the Hamiltonian H . 



The multiplier /ij, gives the rate of change of the Hamiltonian as the value of the constraint 
is varied. 
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4. Optimality Conditions 

We now set up the problem of finding minimum fuel consumption trajectories 
as a Mayer problem. For that, we choose local coordinates . . . jX'^) on our 

spacetime manifold M . In this coordinate system, the metric is given by some 
matrix g^i, = g^y(xP ^ . . . and the Levi-Civita connection is determined by 
the Christoffel symbols F^^ = F^^(a::*', x^, . . . , a;''), which can be computed from 
the metric by the formula 

(6) = ig^"- [do^gpu + dpgo.. - a.5a/3) , {g^ := (9^.)"^ 

If the rocket's trajectory c : [To,ri] — )■ M is given in these local coordinates by 
cc^ = a;^(r), then its spacetime velocity U := c has components If^ = i'^, and its 
covariant acceleration A := VcC = V[/C/ has components 

(7) A" = + r'^i^u"u^. 

As we have seen, the covariant acceleration A is orthogonal to the spacetime velocity 
U, hence spacelike. Consequently, A = aN, where a :— \ A\ and is a unit spacelike 
vector (that is, g{N,N) — 1). The reasonable assumption is that the rocket is able 
to control the direction and intensity of its exhaust, hence N and a. We therefore 
choose a and the components N'^ of N as our control variables. Moreover, it is 
physically natural (and, as it turns out, mathematically desirable) to assume an 
upper bound a on the acceleration that the rocket is able to achieve. This leads 
to the constraint a < a, which we will implement by introducing an extra control 
variable a. Therefore, we have d + 3 control variables (a, A^°, N^, . . . , N'^, a). 

As state variables, describing the rocket's trajectory, we must then choose the 
rocket's spacetime coordinates and the components of its spacetime velocity. 
Moreover, we introduce an additional variable / (called the cost), which we will 
use to measure the fuel consumption. Therefore, we have 2d + 3 state variables 
{x°,x^,...,x'',U°,U'^,..., /). From ©, the evolution of the first 2d + 2 state 
variables will be given by the equations 

and we choose 

i = a. 

By ([5]) , the final mass of the rocket is will then be 

mi = mo exp 

where Ii := I{ti). Therefore, minimizing the fuel consumption is equivalent to 
minimizing the final cost, J = Ii. We will, however, keep J general, as it makes 
little difference in the minimum conditions. 

The fact that we want to use the proper time r as the time parameter must be 
specified through the constraint equation 

(8) g^.M''N'' = 0. 

This guarantees that A = WjjU is orthogonal to U, implying that the length of U 
is constant; we will assume the initial conditions to be chosen so that 

(9) gf^^Uf^U" = -1 




6 PEDRO G. HENRIQUES AND JOSE NATARIO 

(note that this is not an extra constrain10) . A further constraint is necessary to 
specify that a is the norm of the acceleration, or, equivalentfy, that is a unit 
vector, namefy 

(10) g^..N^N'' = 1. 

FinaUy, it is necessary to require that < a < a. These inequahties can be 
accommodated by the constraint equation 

a(a — a) = . 

We can now write the minimum conditions. The Lagrange expression is 

(11) (-r^^[/"t/'^ + aiV^) + p^+d+if/'^ 



' P2d+2 a 



- ^iig^^^U^'N'' - ^2 {g^^.N'^N'' - 1) - /i3 [a{a - a) 
Therefore, we have the Hamilton equations 

(12) <^ p^+a+i = -S& = Pjd^Tli.U'^U^ + d^g^p {fnU'^N^ + M2iV"A^) , 

[p2d+2 = -If = 0, 
and the Pontryagin conditions 



(13) 



8F 



0=1^= Pm^" + P2d+2 + M3(2a - a), 
0=€ = 2M3a. 



The last Pontryagin condition (1131) implies that either a = (in which case we must 
have a = or a = a) or /Lt3 = (in which case we may have < a < a). Therefore, 
the rocket's trajectory is naturally divided into zero acceleration (ZA) arcs (a = 0), 
maximal acceleration (MA) arcs (a = a) and intermediate acceleration (lA) arcs 
(0 < a < a)Q 

Assume first that a > 0. Then the first Pontryagin condition ([TS]), together with 
(©, dH) and (Uni), implies 

(14) P^. + {PcUn = pN^, 
where 

Since N has to be chosen so that Lagrange expression (1111) is maximized, we must 
have p > (because otherwise it would be possible to increase p, and so F, by 
reversing N). If the constraints are satisfied, the terms proportional to a in the 
Lagrange expression are 

(15) {p + p2d+2)a =■■ ka. 



cannot use ^ directly as the constraint because its Jacobian matrix, with respect to the 
control variables, vanishes. 

^If we are minimizing the fuel consumption, then the lA arcs, where the acceleration is not 
constrained, must be solutions of the Euler-Lagrange equations for the second order Lagrangian 

L{x°,...,x'^,x°,...,x'',x°,...,x'') = [g^^ (xi" + r'^^x°'xl^^ (i" + TJ^^iTa 
subject to the constraint g^^i^i'' = —1. 
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Since again the choice of a has to maximize the Lagrange expression, we must 
have A; > on a MA arc (because otherwise it would be possible to increase F by 
decreasing o) and A: = on a lA arc (because otherwise it would be possible to 
increase F by either increasing or decreasing a). 

On a ZA arc, the control variables can be chosen arbitrarily (as long as they 
satisfy the constraint equations). Therefore, we keep the choice 

p 

P 

whenever p 0, where 

■.= P^ + {Pc,U'')U^ 

are the components of the projection of the vector p on the hyperplane orthogonal 
to U and 

With this choice, ([HI) and (|15l) still hold, and so maximizing the Lagrange expres- 
sion with respect to to a requires fc < on a ZA arc (because otherwise it would 
be possible to increase F by increasing a). 

The Lagrange multipliers pi and fi2 can be determined by contracting the first 
Pontryagin condition ([T3| with U and N, which yields 

(16) ap^C/^ + /ii - 
and 

(17) ap = 2p2- 

The first Hamilton equation can be rewritten as 

(18) "^uP,.^ -qf.+ PiN^,, 
where 

(19) Qt, ■■= Pt,+d+i - Tl^p-yW^- 
The second Hamilton equation (fT2|) then yields 

(20) 

^Pf^+d+i - d^Ti^p.u^u^ - ri^p^u" - ri^p.u" - n^.q^v'^ 

+ d^g^p {piU"N^ + P2N^nP) - piTl^N^U^ - piTl^U^N" ~ 2p2Tl^N,N", 
where we have used ((HI), (|16l) and (IT71) to write 

ap-y — —a [psU^) U-y + apN-y — piU-y + 2p2Nj. 
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Now, using dl]), we have 

= d^ga.pU'^N^ - \d^g^p (f/"iV^ + 7V"[/^) = 0, 
and, similarly, 

d^go^^N^N^ - 2Tl^N,N'^ 

= d^ga^pN'^N^ - g-^f" {d^gp^ + d^gp^ - Dpg^^) N^N'^ 
= {d^gai3 - dagpp. - d^gpa + dpgafj,) N"N'^ = 0. 

Therefore, the terms proportional to /ii and /i2 drop out of ((20|) . and we are left 
with 

It is convenient to write the Hamilton equations using the modified momentum 
variables P^. We have 

(21) WuP^ = {p^ + ip^U") U^) = WuPf. + fU^ + aN^ 
where we used (|18p and and defined 

This function can be determined from the Hamiltonian first integral 
H^Pt. ['KpU'^U^ + aTV^) + p^+d+iW^ + P2d+2 a, 
which, in view of (|15p and (IT9| . can be written as 

H = q^Uf" + ka. 
In fact, if we contract (|211) with U, we obtain 

/ = - (Vc/Pp) [/^ - q^U^" = Pp (Vc/?/'') -H + ka = pa + ka-H. 
If Ti is not specified, then 

dri 

The remaining endpoint conditions depend on the exact nature of the problem. The 
standard problem of minimizing the fuel consumption, as we have seen, corresponds 
to J = /i . In this case, the values of all state variables except / are fixed at t — ti, 
and we have the additional endpoint condition 

dJ 

oh 

Moreover, if ti is not fixed, then iJ = 0. 

Following [T], we shall call the spacelike vector field P and the function k the 
primer and the switching function of the trajectory. Like all variables in our 
problem, they are piecewise smooth. Moreover, it is known from the general theory 
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of the Mayer problem that both the momentum variables and the Hamiltonian 
must be continuous along the optimal trajectory. Therefore, both the primer and 
the switching function are continuous. The continuity of H then implies that k 
must vanish at the discontinuities of a. 

To summarize, we have proved the following result. 

Theorem 4.1. The optimal motion oj the rocket is obtained from the motion equa- 
tion 

by choosing 
where 

and the primer satisfies 
(22) 
and 



P 



^uPf. = + (pa + ka- H)U^,, 



Pt.U'' = 0. 

Here, the switching function k := p + P2d+2, with p2d+2 constant, satisfies k > on 
MA arcs, k = on lA arcs and k < on ZA arcs, and the Hamiltonian integration 
constant H = qf^U^ + ka is given by 

OTl 

if Ti is not fixed. Moreover, P2d+2 = ^1 for the fuel minimizing problem. 

5. Instantaneous Accelerations 

The problem is simplified if we take the limit a — > +oo, corresponding to situ- 
ations where the duration of the maximum acceleration periods is negligible. We 
assume that this limit exists and that all functions converge to piecewise smooth 
functions. In this limit, the MA arcs collapse to points where we have instantaneous 
(Dirac delta) accelerations At such points we have: 

(a) x^^ is continuous, because — is at most discontinuous; 

(b) is continuous, because '^uqfj. = Rfj.a/SjU'^P^U'^ is at most discontinuous; 

(c) and P^ have a discontinuity given by a (positive) boost in their plane, 
because P^JJ^ = 0, VjjU^ is a positive multiple of P^, and, in the limit, VjjP^^ 
is a positive multiple of (in particular, p, and hence k, are continuous); 

(d) fc = 0, because this always happens at the endpoints of a MA arc. 

Both k and p attain their maximum at the instantaneous acceleration, where we 
must therefore have p > Q. Statement (jcj can then be written as 



(23) 



U^f = [/^ coshw + ^P^ sinh u, 
: pU^ sinh u + Pf cosh u, 



^See | 12| for a general framework dealing with impulsive controls. 
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for some u > 0, where the indices — and + denote the limits at the instantaneous 
acceleration from the left and fr-om the right. Since fca = on both ZA and lA 
arcs, the Hamiltonian integration constant yields 

which, together with ()23|) . imply 

coshw — 1 

Since by (22) 
we have 

. ,,cosh7i — 1 

24 p_ = H^- = 

smhw 

that is, p reverses its sign at the instantaneous acceleration. 

If H = Q (as is the case if ti is not specified and J does not depend on ti), then 
p4l) implies that p is continuously differentiable with p = at an instantaneous 
acceleration^ In fact, in this case, p is actually a function: using fca = on ZA 
and I A arcs, whence q^j.U'^' — 0, and P^t/'' = 0, we have 

P^+PP= {VuPn (VuP^) + p"- (yuyuP^.) 

= i-q^ + paU^") i-q^ + paU^) + P'' {-Vuq'' + paVuU^) 

= q^q^ - p^a^ - P^' R^^p^U'' P'^U'' + p\^ = q^q,, + p^K, 

where K := —Rap^sU"' N^U^ is the sectional curvature of the plane spanned by 
U and P, and so p is continuous at an instantaneous acceleration. 
We collect all these results in the following statement. 

Theorem 5.1. At an instantaneous acceleration we have: 

(a) U and P have a discontinuity given by a (positive) boost in their plane; 

(b) p reaches its maximum and is continuous; 

(c) p reverses sign; 

(d) if H — 0, then p is a function. 

6. 1-DlMENSIONAL PROBLEMS 

We now consider the simplest case d = 1, corresponding to problems where the 
rocket moves along a line. The spacetime (Af , g) is thus 2-dimcnsional. 
Because p is constant on lA arcs, we have from (|22l) 

P^g^ = -P^Vc/P^ = -pp = 0. 

Since {U,N} is an orthonormal frame and qf^U^ ~ H, we must have 

and hence, from the Hamilton equations (|22|) . 

-HaN^, = R^^fi^U°'Pf^U^ ^Ha = pK 

(where K is the sectional curvature). It immediately follows that, if if = (as will 
be the case if ri is not specified and J does not depend on ri), then lA arcs can 



''Note that this does not have to be true for instantaneous accelerations at the endpoints of 
the optimal trajectory. 
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only exist in a fiat spacetime. Even if 7^ 0, I A arcs can only exist if the curvature 
has the same sign as H. 

On ZA arcs, on the other hand, we have WjjU = 0, i.e. U is parallel transported. 
Since {U, N} is an orthonormal frame, it follows that N is also parallel transported. 
Using P — pN, the Hamilton equations (|22p reduce to 

pN^ = -R^o.pjWP^U'' ^p = Kp. 

In summary, we have the following result. 

Theorem 6.1. If{M,g) is 2-dimensional, then: 

(a) Ha = pK on I A arcs; 
(h) p = Kp on ZA arcs. 

7. FLRW Models 

Recall that the Friedmann-Lemaitre- Robertson- Walker (FLRW) models are given 
in spherical coordinates {t, r, 9, ip) by metrics of the form 

ds^ = -df + [^^2 _^ 5.^2^^) f^^Q2 _^ gij^2 g,^^2^j ^ 

where 2]o(r) := r (flat universe), Si(r) := sinr (spherical universe) and I]_i(r) := 
sinhr (hyperbolic universe). These metrics describe spatially homogeneous and 
isotropic universes, and are believed to provide a good model for the large scale 
structure of our own universe. The matter content of these universes is usually 
taken to be a pressureless perfect fluid of uniform density p, whose elements are 
fixed at constant spatial coordinates, and are interpreted as (the averaged version 
of) galaxies (or clusters of galaxies). The Einstein field equations then determine 
the time evolution of the scale factor R{t) and the matter density p{t), according 
to the Fricdmann equations 

' _2E A k 
R^~R^^3^R^' 

< 

3E 
^ A-nR^ 

(where > is a constant determining the matter density and A is the cosmological 
constant). The flat Minkowski spacetime is a particular case, corresponding to 
k = E^ K^O. 

We consider the problem of transfer between two galaxies in a FLRW universe. 
Assuming the first galaxy to be placed at r = 0, it is easily seen that the motion 
must occur along the line of constant {9, ip) joining the two galaxies, and so this is 
naturally a 1-dimensional problem. Our spacetime will then be the totally geodesic 
2-dimensional submanifold of constant {9,tp), with metric 

ds^ = -df ^ R^{t)dr^. 

Choosing the natural orthonormal coframe 

a;° := dt, := R{t)dr, 

one readily obtains from the first Cartan structure equations, 

dw" + w^fl A w*^ = 0, 
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the nonvanishing connection forms 

and hence the nonvanishing curvature forms 

R{t) 

Consequently, the sectional curvature is 

Assume that the rocket aims at minimizing the fuel necessary to take it from 
being at rest in the first galaxy (placed at r = 0) at time t ^ to to being at rest in 
the second galaxy (placed at r = ri) at time t = ii0 For this problem, H = 0, 
and consequently there are no lA arcs ii K for to < t < ti. 

If K{t) > (that is, if the expansion is accelerating) for to < t < ti, then, 
from Theorem 16.11 we have p = K p > for p ^ 0, and so all critical points 
of p are necessarily minima. We conclude that there cannot exist instantaneous 
accelerations except at the endpoints of the optimal trajectory, which must then 
be a ZA arc with two instantaneous accelerations at the endpoints. 

The same conclusion holds if K{t) < (that is, if the expansion is not acceler- 
ating) for to <t<ti, but the argument is more complicated. We start by proving 
a Lorentzian version of the Gauss-Bonnet Theorem suited to our problem. To do 
so, we set 

and consider a second orthonormal frame 

-Fo cosh uEo + sinh uEi^ Fi := sinh uEo + cosh u Ei 

(where u is the locally defined hyperbolic angle from Eq to Fo). If c : [tq, ti] — M 
is a timelike curve parameterized by its proper time, then from 

one readily obtains 

VtFo^{u + uj\{c))Fi. 
If Fo is parallel transported along c, then we have 

ii = -w°i(c), 

and hence 

(25) u{c{ti))-u{c{to)) = ~ j^uj\. 

If we now write 

c(r) = coshw(r) Fq + sinh w(r) Fi 
(so that the total hyperbolic angle from _Bo to c is u 4- w), then we have 

VcC — w{s\nhw Fo + coshwFi), 

and, consequently, 

IVcCl = \w\ . 
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We assume (ti,ri) to be in the chronological future of (io,0). 
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The function w{t) is called the geodesic curvature of c. The cost of c is 



(26) 



|w| dr > \w{ti) - w{to)\ , 



with equality if and only if ib does not change sign. 



En 



En 



Eo 




En 



Figure 1. Timelike curves tangent to Eq at the initial or final endpoints. 



Now let c+ and be timelike curves with common endpoints, with c+ to the 
future of c^. Assume the endpoints to be the only intersections and let A C M 
be the region bounded by the two curves. If the curves are tangent to Eq at the 
initial endpoint (where we choose = Fq = Eq), then we must have, at the final 
endpoint, 

+ > + 
(cf. Figure [1]). Now, from we have 

U+-U- = - [ L0\+ [ L0\= [ CU\ = [ dL0\ = [ n\= [ K<0, 
Jc+ Jc- JdA J A Ja J a 

implying that 

(27) w+>w~. 

Similarly, if the curves are tangent to Eq at the final endpoint (where we choose 
Fq — Fq = Eq), then we must have, at the initial endpoint, 

w~ + u~ > + u'^ 

(cf. Figure [T]). Again, from we have 



IdA J A J A J A 
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implying that 
(28) 



< w 




Figure 2. Timelike curves connecting {to,0) and {ti,ri). 

Finally, let 7 be the geodesic connecting the events (to,0) and {ti,ri), regarded 
as tangent to Eq at the two endpoints via two instantaneous accelerations. Let c be 
any smooth timelike curve with the same endpoints, also tangent to Eq at the two 
endpoints. Then c is to the future of 7 near (to, 0), 7 is to the future of c near {ti , ri) 
and the two curves intersect at least once besides the endpoints (cf. Figure [2]) . If 
we take c as c"*" and 7 as c~ between the first endpoint and the first intersection 
(besides the endpoints), then we have from (|27)) 

— 0| = < < \u}^ — 0| . 

Since the sign of w~ does not change, we see from ((26| that the cost of c is greater 
or equal than the cost of 7 between these two points. Analogously, if we take 7 
as c"*" and c as c~ between final intersection (besides the endpoints) and the final 
endpoint, then we have from (f28)) 

|0 — I = vS^ < < |0 — I . 

Since the sign of does not change, we see from ([26l) that the cost of c is greater 
or equal than the cost of 7 between these two points. Since the cost of 7 between 
the first and the last intersections is zero, we conclude that the total cost of c is 
greater or equal than the total cost of 7. 

In fact, it is not difficult to see that the cost of c is strictly greater than the 
cost of 7: if c is tangent to 7 at any intersection (besides the endpoints), then its 
geodesic curvature must change signs. Therefore, we have the following result. 
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Theorem 7.1. If R{t) > or R{t) < for to < t < ti, then the minimum fuel 
consumption trajectory for the transfer of a rocket from being at rest with respect 
to the galaxy r — at time t = to to being at rest with respect to the galaxy r ~ ri 
at time t = ti is a ZA arc with instantaneous accelerations at the endpoints. 

Note that the conditions R{t) > and R{t) < are really the conditions K > Q 
and K < 0, and so this theorem is easily applicable to other l-dimensional problems. 

8. IGNORABLE COORDINATES 

We say that a given coordinate a;"' is ignorable iff the metric does not depend 
on x'^ and the final value is not specified. Under certain circumstances, the 
existence of ignorable coordinates greatly simplifies the minimum conditions for 
ZA arcs. 

Assume that H ~ {a.s is the case if n is not specified and J does not depend 
on Ti). In this case, the minimum equations on a ZA arc reduce to the geodesic 
and Jacobi (geodesic deviation) equations, 

\VuVuP^. = -R^,o.MU'^P^U\ 
and the Hamiltonian first integral becomes 

(Vr/Pj C/^ = 0, 

which is just the proper time derivative of the constraint P^U^ — 0. Since the 
metric does not depend on the ignorable coordinate x'^, the covariant component 
Ua is conserved. Moreover, the Hamiltonian is then independent of a;"^, and hence 
Pa+d+i is also constant. Since x" is not fixed, we have Pa+d+i = 0, that is. 

On the other hand, we have 

implying that 
and, consequently, 

(29) WuPa - n.P^C/" = 0. 

We can obtain a geometrical interpretation of this condition by rewriting ((29)) as 

(30) X^'VuP^. - i^aX^) P^U'' = 0, 
where 

dx'^ 

Thinking of P as a Jacobi field, hence satisfying 

[U, P] = ^ Vc/P'' - Vpf/'' = 0, 
and using the fact the X is a Killing field, 

V^X, + V.X^ = 0, 

we can write (P^]) as 

Xf'VpU^ + U^VpX^ ^O^Vp {U^X'^) = 0. 
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Therefore, ([29l) is the condition that the Jacobi field connects infinitesimahy 
nearby geodesies with the same value of the conserved quantity Uf_iX'^. In other 
words, we have the following result. 

Theorem 8.1. If H — and is an ignorable coordinate, then the Jacobi field 
P connects infinitesimally nearby geodesies with the same value of the conserved 
quantity Ua- 



9. SCHWARZSCHILD SOLUTION 

Recall that the Schwarzschild solution is the unique spherically symmetric solu- 
tion of the vacuum Einstein field equations, given in spherical coordinates (t, r, 9, ip) 
by the metric 

-1 



ds' = -\ l- 



2M 



dt^ 



1 - 



2M 



dr^ 



sin^ Odp'^ 



It represents the gravitational field created by a spherically symmetric body of 
mass M (which we assume to be positive), and so models the vicinity of an isolated 
planet or star (or even a more exotic object, like a neutron star or a black hole). 

We consider the problem of finding the most fuel-efficient transfer between two 
stable circular orbits in the Schwarzschild metric. For simplicity, we restrict our- 
selves to motions the (2 -|- l)-dimensional totally geodesic submanifold 6 = ^, 
corresponding to the equatorial plane, whose metric is 



ds' 



1 



2M 



dt^ 



1 



dr^ 



r'^dip^. 



We will not fix ri (hence H — Q) , ti or ipi (and so t and ip are ignorable coordinates). 
Moreover, the choice J = Ii implies k = p — 1. The geodesic equations in the 
coordinate system above can be written as 

.. 2M ( 2M\^^ . / 2M\ ■ 



(31) 



r I 



M 



1 - 



2M 



t"- - 



M 



r I 



(p H — rip = <;4> r ip = L, 
r 

where E and L are integration constants. Equation for x'^ = t can be obtained 
by linearizing the first geodesic equation (j3ip while keeping E fixed: 

(32) {^-'-f)p'+'-^P^i-O^P*+'JiP.^O. 

Similarly, equation ([29]) for x" = if can be obtained by linearizing the third geodesic 
equation pip while keeping L fixed: 



(33) 



,2p>p ^ 2rP''ip = ^ f"^ 



2M 



(pPr = 0. 



Since Ut and Utp are conserved, the proper time derivative of the constraint 

P'Ut + P^U^ + PrU"- = 
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d 



P'Ut+F^U^ + — {Prf)^Q. 

ar 

Using dsn and ^ yields 

1 )t^P,-2r{l U2p^ + (p,,^) = 0, 

and from the radial component of the geodesic equations (PT|) we have 

2M ( ^ 2Af\"\ 



(34) - 2rPr + ^ 1 r^Pr + PrV + Pr'r = 0. 

For non-circular orbits, this equation is readily solved: 



^ 2M\ ^ 



Pr = Ar\\- —j ^ P' = Ar, 

where A G M is an integration constant. Substituting on (1321) and using the first 
geodesic equation (PT|) yields 

P* = At ^ P^ = Ai + B, 

where i? G R is another integration constant. Similarly, substituting on (p3| and 
using the third geodesic equation pip yields 



P"^ ^ Aifi P'^ = Acp + C, 

where C G R is yet another integration constant. Therefore, P^ is a linear combi- 
nation of three obvious Jacobi fields: 

d d 
P^AU + B- + C—. 

at dip 

The condition P^f/^ = imposes a restriction on the integration constants, namely 

-A + BUt + CU^ = 0. 

We then see that 

/ 2M\ 

p2 ^ p^pt, ^ _ ii_ _ \ p2 ^ ^2^2 ^ 2ABUt + 2ACU^ 
= A^+(—-l)B^+r^C\ 



r 



and, consequently. 



. , / 2M , , 

(35) 2pp= i-—B^ + 2rC^ 

Thus p can only be zero if r = or C^r^ = MB"^ . This second possibility, however, 
corresponds to a minimum of p^, and hence cannot happen at an instantaneous 
acceleration or transition to an lA arc. We conclude that the optimal trajectory 
can only enter or leave a non-circular orbit at a turning point, that is, a point where 
r = 0. At these points, we will have P^ = Ar — 0, meaning that the acceleration 
will be tangential to the orbit. 
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For circular orbits, p4p does not yield any information, and we have to write 
the radial component of the Jacobi equation. A shortcut to do so is the following: 
using the normalization condition 



r + r if = — 1 , 

r J ' " ' 

we can recast the radial component of the geodesic equations ([31]) in the form 

M f 3M\ L"^ 
(36) - + j7F = 0- 

Linearizing the equation above while keeping L constant yields 



2 



(37) P'' + (^1 - — j -^P" = 0> 

where we used (j36p to write 

\ r J r 

on a circular orbit. From 

P^t/^ = ^ P^Ut + P'^C/^ = ^ P* = -^pv 



and 



we have 



Therefore 



Ut (l-2M)t 1 2Af 



1 _ 2M ^ • 



2M\ Mr , 2M^ ^ 



= - ( 1 - ^ j ip')' + ( 1 - — ) i^"^)^ + ipn' 



2M^ ^ 



y (iprf+(l-^-M.)r^P^ 



r / \ \ r 



and hence 



PP 



fp^p^+ri-^^ 



pip 

r J \ \ r J 

1--) ^'(^^-(i--)^v)' 

where we used ((55)). So, for p = 1, we will have p = if P*" = 0, or 
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In this second case we will have 



r I \ \ r I r 



ot2 

(39) (P'f, 

which is strictly positive unless — 0. We conclude that the optimal trajectory 
can only enter or leave a circular orbit at points where = 0, meaning that the 
acceleration will be tangential to the orbit. 

We now show that the optimal trajectory cannot contain lA arcs. Let us start 
by showing that, if a lA arc exists, then none of its endpoints can be a non-circular 
ZA arc. Since p is a function and p = 1 on a lA arc, then we must have p = 
at ZA arc's endpoint. From ([55]) and the fact that p = f = at the endpoint, we 
obtain 

Now, r never vanishes at a turning point, and the expression in brackets vanishes 
only at the minimum of p. We conclude that p cannot vanish at the endpoint, and 
hence a I A arc cannot have an endpoint on a noncircular ZA arc. 

For circular orbits, the story is more complicated. Using the formula 



r I \ \ r I r ) r' 



obtained while deducing ([39|), we have 



Since on a transition point to a I A arc we must have p = \ and p = p = P^ = 0, 
the formula above requires that either = or ([35]) holds. Differentiating again 
yields 



3pp + pP = (l-— 1 pr\pr-U-—\P--\^—P^P^ 



T 1 \ T J 7*^ \ \ T 1 T 1 



pr pr 



which shows that p = at the transition point. Finally, differentiating once more 
and assuming that (p8| holds, we have 



P = .4 



showing that, if P^ ^ 0, then p would have a local minimum at the transition point. 
But that is impossible, since p < 1 on the circular orbit; therefore, we must have 
P*" = = at the transition point. Since P^ satisfies the second-order linear 
ODE ((37)) . we conclude that P^ must vanish identically on the circular orbit, and 
hence P'^ must be a linear combination of the Killing vector fields ^ and ^ . These 
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fields are orthogonal and have constant norm over the circular orbit, and hence we 
will have p = 1 on the circular orbit. 

We then see that a lA arc can only join circular orbits where p = 1. The same 
reasoning as for lA arcs shows that these circular orbits cannot have an endpoint 
on a non-circular orbit. We conclude that, if the optimal trajectory contains a 
lA arc, then it must consist solely of lA arcs, circular orbits and instantaneous 
accelerations, and, moreover, we must have p = 1 throughout the trajectory. 

On the other hand, if the optimal trajectory contains a stable circular orbit, then 
it is always possible to add a non-circular ZA arc. Indeed, if the trajectory departs 
from the stable circular orbit via a lA arc, then we can interrupt this arc after a 
sufficiently small proper time interval to achieve a non-circular orbit. This orbit 
is periodic in r and will return to the same values of (r, f) after a certain proper 
time period, at which point the initial lA arc can be resumed (from different values 
of t and 95, but the final values of these variables are not fixed). If there is an 
instantaneous acceleration at the stable circular orbit's endpoint, then it can be 
broken into an initial small enough instantaneous acceleration, followed by a non- 
circular ZA arc, followed by the remaining instantaneous acceleration (again from a 
point with different values of t and ip). This has exactly the same cost as the original 
instantaneous acceleration, and so the resulting trajectory will also be optimal. We 
conclude that, if the optimal trajectory contains a stable circular orbit, then no lA 
arc can exist. 

Therefore we have proved the following result. 

Theorem 9.1. The optimal trajectory for the transfer of a rocket between two 
stable circular orbits in the Schwarzschild metric with minimum fuel consumption 
is composed of ZA arcs and instantaneous accelerations tangent to the ZA arcs, 
which must be at turning points for noncircular ZA arcs. 

The simplest of such trajectories is the relativistic analogue of the celebrated 
Hohmann transfer manoeuvre [T3j [Tj , consisting of a single noncircular ZA arc 
connecting the two stable circular orbits (cf. Figure [3]). 

10. Conclusions 

We have derived the covariant optimality conditions for rocket trajectories in 
general relativity, with and without a bound on the magnitude of the proper ac- 
celeration (Theorems 14.11 and I5.1|) . These conditions are essentially a covariant 
differential equation for an auxiliary vector field defined along the trajectory (the 
primer) , closely related to the Jacobi equation. We studied these conditions for gen- 
eral 1-dimensional problems (Theorem 16. II) . and then particularized to the problem 
of the optimal transfer between two galaxies in a FLRW model. We found that for 
generic classes of models (including the flat Minkowski spacetime) the optimal tra- 
jectory consists of an initial instantaneous acceleration, followed by free-fall motion 
and a final instantaneous acceleration (Theorem 17. ip . We showed that ignorable 
coordinates lead to great simplifications in the optimality conditions (Theorem l8.1l) , 
and used this to study the problem of optimal transfer between two stable circular 
orbits in the Schwarzschild spacetime. Here we found that the optimal trajecto- 
ries consist of free fall motions connected by tangential instantaneous accelerations 
(Theorem l9.1|) . including the relativistic analogue of the celebrated Hohmann trans- 
fer manoeuvre. 
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Instantaneous acceleration 




Figure 3. Hohmann transfer manoeuvre. 
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